Probabilistic Linear Discriminant Analysis for Acoustic Modelling
نویسنده
چکیده
In this letter, we propose a new acoustic modelling approach for automatic speech recognition based on probabilistic linear discriminant analysis (PLDA), which is used to model the state density function for the standard hidden Markov models (HMMs). Unlike the conventional Gaussian mixture models (GMMs) where the correlations are weakly modelled by using the diagonal covariance matrices, PLDA captures the correlations of feature vector in subspaces without vastly expanding the model. It also allows the usage of high dimensional feature input, and therefore is more flexible to make use of different type of acoustic features. We performed the preliminary experiments on the Switchboard corpus, and demonstrated the feasibility of this acoustic model.
منابع مشابه
Discrimination of Golab apple storage time using acoustic impulse response and LDA and QDA discriminant analysis techniques
ABSTRACT- Firmness is one of the most important quality indicators for apple fruits, which is highly correlated with the storage time. The acoustic impulse response technique is one of the most commonly used nondestructive detection methods for evaluating apple firmness. This paper presents a non-destructive method for classification of Iranian apple (Malus domestica Borkh. cv. Golab) according...
متن کاملFeature-space speaker adaptation for probabilistic linear discriminant analysis acoustic models
Probabilistic linear discriminant analysis (PLDA) acoustic models extend Gaussian mixture models by factorizing the acoustic variability using state-dependent and observationdependent variables. This enables the use of higher dimensional acoustic features, and the capture of intra-frame feature correlations. In this paper, we investigate the estimation of speaker adaptive feature-space (constra...
متن کاملProbabilistic linear discriminant analysis with bottleneck features for speech recognition
We have recently proposed a new acoustic model based on probabilistic linear discriminant analysis (PLDA) which enjoys the flexibility of using higher dimensional acoustic features, and is more capable to capture the intra-frame feature correlations. In this paper, we investigate the use of bottleneck features obtained from a deep neural network (DNN) for the PLDA-based acoustic model. Experime...
متن کاملA probabilistic model of integration of acoustic cues in FV syllables
The interaction of consonantal and vocalic segments in FV syllables regarding identification of place of articulation of fricatives has been studied. A probabilistic model for integration of acoustic information in both segments is proposed. The model weights each segment’s contribution and integrates them in order to resemble listeners’ perception. First, the perceptual validity of the model h...
متن کاملUTD-CRSS Systems for 2012 NIST Speaker Recognition Evaluation The CRSS SRE Team
This document briefly describes the systems submitted by the Center for Robust Speech Systems (CRSS) from The University of Texas at Dallas (UTD) for the 2012 NIST Speaker Recognition Evaluation. We developed a state-of-the-art i-vector based speaker recognition system [1]. Probabilistic linear discriminant analysis (PLDA) [2] along with several other backends are used for channel/noise compens...
متن کامل